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Abstract 

A rotation-based synthesis framework for reversible logic is proposed. We develop a canonical rep- 
resentation based on binary decision diagrams and introduce operators to manipulate the developed 
representation model. Furthermore, a recursive functional bi-decomposition approach is proposed to 
T-H automatically synthesize a given function. While Boolean reversible logic is particularly addressed, 

our framework constructs intermediate quantum states that may be in superposition, hence we com- 
bine techniques from reversible Boolean logic and quantum computation. The proposed approach 
^ results in quadratic gate count for multiple-control Toffoli gates without ancillae, linear depth for 

quantum carry-ripple adder, and quasilinear size for quantum multiplexer. 

1 Introduction 

^ The appeal for research on quantum information processing [I] is due to three major reasons. (1) Working 

with information encoded at the atomic scale such as ions and even elementary particles such as photons is 
pT^I a scientific advance. (2) Direct manipulation of quantum information may create new capabilities such as 

^ ultra- precise measurement [2], telemetry, and quantum lithography [3], and computational simulation of 

^ quantum-mechanical phenomena [4]. (3) Some time-exponential computational tasks with non-quantum 

input and output have efficient quantum algorithms [1]. Particularly, most quantum circuits achieve a 
quantum speed-up over conventional algorithms [S]. However, useful applications remain limited. 
^ Recent advances in fault-tolerant quantum computing decrease per-gate error rates below the thresh- 

old estimate [5] promising larger quantum computing systems. To be able to do efficient quantum 
computation, one needs to have an efficient set of computer-aided design tools in addition to the ability 
of working with favorable complexity class and controlling quantum mechanical systems with a high 
fidelity and long coherence times. This is comparable with the classical domain where a Turing machine, 
a high clock speed and no errors in switching were not adequate to design fast modern computers. 

Quantum circuit design with algorithmic techniques and CAD tools has been followed by several 
I researchers. The proposed methods either addressed permutation matrices [7] or unitary matrices, e.g., 

[8j. Permutation matrices and reversible circuits are an important class of computations that should 
.J^ be efficiently performed for the purpose of efficient quantum computation. Indeed, Boolean reversible 

circuits have attracted attention as components in several quantum algorithms including Shor's quantum 
^ factoring [51 [TU] and stabilizer circuits [TT] . 

In this paper, a canonical decision diagram-based representation is presented with novel techniques for 
synthesis of circuits with binary inputs. This work may be considered along with the work done for the 
synthesis of reversible circuits [7]. However, we work with rotation-based gates which allow computing a 
Boolean function by leaving the Boolean domain [T^] . Therefore, this approach may be viewed as a step 
to explore synthesis of reversible functions by gates other than generalized Toffoli and Fredkin gates. 
We show that applying the proposed approach improves (1) circuit size for multiple-control Toffoli gates 
from exponential in [13l Lemma 7.1] to polynomial and from 48n^ +0{n) [131 Lemma 7.6] to 2n^ + 0{n), 
(2) circuit depth for quantum carry- ripple adders by a constant factor compared to |14j . and (3) circuit 
size for quantum multiplexers from 0{n^) to 0(n log^ n). 
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The reminder of this paper is organized as follows. In Section[2] we touch upon necessary background 
in reversible and quantum circuits. Readers familiar with quantum circuits may ignore this section. 
Section [3] summarizes the previous work on quantum and reversible circuit synthesis. In Section [4] the 
proposed rotation-based technique is described. In Section [Sj we provide an extension of the proposed 
synthesis algorithm to handle a more general logic functions, i.e., functions with binary inputs and 
arbitrary outputs. Synthesis of several function families are discussed in Section [6j and finally Section [7| 
concludes the paper. A partial version of this paper was presented in [15] . 

2 Basic Concepts 

A quantum bit, qubit, can be realized by a physical system such as a photon, an electron or an ion. In 
this paper, we treat a qubit as a mathematical object which represents a quantum state with two basic 
states |0) and |1). A qubit can get any linear combination of its basic states, called superposition, as 
= a\0) + where a and (3 are complex numbers and |ap + |/3p = 1. 

Although a qubit can get any linear combination of its basic states, when a qubit is measured, its 
state collapses into the basis |0) and |1) with the probability of jap and |/3p, respectively. It is also 
common to denote the state of a single qubit by a 2 x 1 vector as [ a /? ]^ in Hilbert space H where 
superscript T stands for the transpose of a vector. A quantum system which contains n qubits is often 
called a quantum register of size n. Accordingly, an n-qubit quantum register can be described by 
an element \^p) = \ipi) €5 IV'2) (8) ... \ipn) (simply |V'iV'2 • • "0")) in the tensor product Hilbert space 
iJ = i/i (8) i/2 «! • • • «) i/„- 

An n-qubit quantum gate performs a specific 2" x 2" unitary operation on selected n qubits. A matrix 
U is unitary if UU^ — I where is the conjugate transpose of U and / is the identity matrix. The 
unitary matrix implemented by several gates acting on different qubits independently can be calculated 
by the tensor product of their matrices. Two or more quantum gates can be cascaded to construct a 
quantum circuit. For a set of k gates gi,g2, - ■ ■ ,gk cascaded in a quantum circuit C in sequence, the 
matrix of C can be calculated as MkMk~i ■ ■ ■ Mi where Mi is the matrix of the i*^ gate (1 < i < k). For 
a quantum circuit with unitary matrix U and input vector ipi, the output vector is '02 = Uipi. 

Various quantum gates with different functionalities have been introduced. The 0-rotation gates 
(0 < 9 < 27r) around the x, y and z axes acting on one qubit are defined as Eq. ([T]). The single-qubit 
NOT gate is described by the matrix X in Eq. The CNOT (controlled NOT) acts on two qubits 
(control and target) is described by the matrix representation shown in Eq. ([2]). The Hadamard gate, 
H, has the matrix representation shown in Eq. ([2]). 



X = ° J ) , CNOT 

Given any unitary U over m qubits \x1X2 ■ ■ ■ Xm), a controlled-[/ gate with k control qubits \y1y2 • • ■ yk) 
may be defined as an (to + fc)-qubit gate that applies U on \x1X2 ■ ■ • Xm) iff \y1y2 ■ ■ ■ 2/A:) = |11 • • • 1). For 
example, CNOT is the controUcd-NOT with a single control, Toffoli is a NOT gate with two controls, and 
CRx{0) is a Rx{0) gate with a single control. Similarly, a multiple-control Toffoli gate C'^NOT is a NOT 
gate with k controls. Fig. [l] shows CNOT and Toffoli gates. For a circuit implementing a unitary U , it 
is possible to implement a circuit for the controlled- ?7 operation by replacing every gate by a controlled 
gate. In circuit diagrams, • is used for conditioning on the qubit being set to value one. 

3 Previous Work 

Synthesis of 0-1 unitary matrices, also called permutation, has been followed by several researchers during 
the recent years. Here, we review the recent approaches with favorable results. More information can 
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Figure 1: CNOT (a) and Toffoli (b) gates. Decomposition of a TofFoli gate into 2-qubit gates (c) where 
V ^ + iX)/2 [13]. 
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Figure 2: New definitions for CNOT (a) and Toffoli (b) gates using controlled rotation gates. Decom- 
position of a Toffoli gate into 5 2-qubit controlled-rotation gates (c). 



be found in [7]. Transformation-based methods [16 iteratively select a gate to make a given function 
more similar to the identity function. These methods construct compact circuits mainly for permutations 
with repeating patterns in output codewords. Search-based methods [17, explore a search tree to find a 
realization. These methods are highly useful if the number of circuit lines and the number of gates in 
the final circuit are small. Cycle-based methods |18j decompose a given permutation into a set of disjoint 
(often small) cycles and synthesize individual cycles separately. These methods are mainly efficient for 
permutations without repeating output patterns. BDD-based methods |19| use binary decision diagrams 
to improve sharing between controls of reversible gates. These techniques scale better than others. 
However, they require a large number of ancilla qubits. 

Quantum-logic synthesis deals with general unitary matrices and is more challenging than reversible- 
logic synthesis. Synthesis of an arbitrary unitary matrix from a universal set of gates including onc-qubit 
operations and CNOTs has a rich history. Barenco et al. in 1995 showed that the number of CNOT 
gates required to implement an arbitrary unitary matrix over n qubits was 0(n^4"). As of 2012, the 
most compact circuit constructions use ||4" - §2" -I- | CNOTs [HIISO] and i4" ^2" - n - 1 one-qubit 
gates [21 . The sharpest lower bound on the number of CNOT gates is [1(4" — 3n — 1)] [22j. Different 
trade-offs between the number of one-qubit gates and CNOTs are explored in [23] . 



4 Rotation-Based Synthesis of Boolean Functions 



In this section, we address the problem of automatically synthesizing a given Boolean function / by using 
rotation and controlled-rotation gates around the x axis. In this paper, we change the basis states as 
= [ 1 ] and 1 = i?a;(7r)0 = [ — * ] ■ With this definition of and 1, the basis states remain 
orthogonal. Also, inversion (i.e., the NOT gate) from one basis state to the other is simply obtained by 
a RxM gatej^ Subsequently, the CNOT gate can be described by using the CRx{tt) operator shown in 
Fig. |2la). In addition, the Toffoli gate may be described by using the C'^Rx{tt) operator illustrated in 
Fig. [2jb). Toffoli gate can be implemented using 5 controlled-rotation operators as demonstrated in Fig. 
[2][c). Recall that a 3-qubit Toffoli gate needs 5 2-qubit gates if |0) and |1) are used as the basis states 
(Fig. He)). 

For a 2-qubit CRx{0) gate with a control qubit a and a target qubit b, the first output is equal to 
a. However, the second output depends on both the control line a and the target line b. We use the 
notation aRx{0)b to describe the second output. Furthermore, we write Rx{0)b to unconditionally apply 



^While we used 6 and 1 as the basis states, the presented algorithm can be easily modified to be applicable to quantum 
functions described in terms of |0} and An alternate solution is to define the following operators and use M to transform 
the |0} and |1) states to and 1 states and operator to perform the reverse transformation. Hence to compute in |0) 

and |1) basis, one needs to apply M and single-qubit operators before and after the computation done in the and 

1 basis, respectively. Notice that M and are rotations around the z axis. 



M : 







1 
i 



3 



a single-qubit Rx{d) to the qubit h. Additionally, one can show that for binary variables a, h, c we have 
ai?,(0i)[ai?,(02)6] = aRx{ei+e2)b, aR,{0i) [bRx{02)c] = bR,{92) [ai?x(^i)c], aR^in)! a is used 
for negation), and ai?^. (7r)0 = a. 

Definition 4.1 and all variables are in the rotation-based factored (factored in short) form. If h and 
g are in the factored form, then Rx{9)h and gRx{0)h are in the factored form too. 

In a quantum circuit synthesized with Rx{0) and CRx{0) operators, all outputs and intermediate 
signals in the given circuit can be described in the factored form. For example, the output r in Fig. [2jc) 
can be described as r = [aRx{'K)b] Rx{—tt/2) [aRx{n/2) [bRx{n/2)c\]. 

Definition 4.2 and all variables are rotation-based cascade (cascade in short) expressions. If is a 
cascade expression and w is a variable ^ h, then Rx{9)h and vRx{0)h are cascade expressions too (V6'). 

A cascade expression can be expressed as Rx{Oo) [viRx{Oi) [v2Rx{d2) ■ ■ ■ [vnRx{0n)&\ ] ] ■ The prob- 
lem of realizing a function with RxiO) and CRxiO) operators is equivalent to finding a cascade expression 
for the function. To do this, we first introduce a graph-based data structure in the form of a decision 
diagram for representing functions. 



4.1 A Rotation-Based Data Structure 

The concept of binary decision diagram (BDD) was first proposed by Lee [55] and later developed by 
Akers |25j and then by Bryant (26j, who introduced Reduced Ordered BDD (ROBDD) and proved its 
canonicity property. Bryant also provided a set of operators for manipulating ROBDDs. In this paper, 
we omit the prefix RO. BDD has been extensively used in classical logic synthesis. Furthermore, several 
variants of BDD were also proposed for logic synthesis [12], verification |27| [53] and simulation 
[301 131| of reversible and quantum circuits. In this section, we describe a new decision diagram for the 
representation of functions based on rotation operators. Next, we use it to propose a synthesis framework 
for logic synthesis with rotation gates. 

Definition 4.3 A Rotation-based Decision Diagram (RbDD) is a directed acyclic graph with three types 
of nodes: a single terminal node with value 0, a weighted root node, and a set of non-terminal (internal) 
nodes. Each internal node represents a function and is associated with a binary decision variable with two 
outgoing edges: a weighted 1-edge (solid line) leading to another node, the 1-child, and a non-weighted 
0-edge (dashed line) leading to another node, the 0-child. The weights of the root node and 1-edges are 
in the form of Rx{d) matrices. We assume that — tt < 9 < tt. When a weight (either for an edge or the 
root node) is the identity matrix (i.e., Rx{0) — I), it is not shown in the diagram. 



The left RbDD in Fig. 3(a) shows an internal node / with decision variable a, the corresponding 
and 1 edges, and child nodes /o and /i. The relation between the RbDD nodes in this figure is as 
follows. If a = 1, then / =^a;(^)/i else / — Jq. In addition, if / is a weighted root node as shown in 



the right RbDD in Fig. 3(a) then for o = 1 we have / = Rx{dr)Rx{0)fi — Rxi^r + d)fi', otherwise 
/ = Rx{dr)fo- Similar to BDDs, in RbDDs isomorphic sub-graphs which are nodes with the same 
functions are merged. Additionally, if the 0-child and the 1-child of a node are the same and the weight 
of 1-edge is Rx{0) — I, then that node is eliminated. Using these two reduction rules and a given total 
ordering -< on input variables, one can uniquely construct the RbDD of a given function. Notably, a 
decision diagram called DDMF was proposed in where each edge can represent any unitary matrix 
including rotation operators. DDMF was used for verification of quantum circuits. 

For a given function / with n binary variables vi,V2, ■ ■ • ,Vn, each value assignment to wi, z;2, • ■ ■ ,Vn 
corresponds to a path from the root to the terminal node in the RbDD of /. Assuming the variable 
ordering vi < V2 < ■ ■ ■ < Vn, the corresponding path can be identified by a top-down traversal of 
the RbDD starting from the root node. For each node visited during the traversal, we select the edge 
corresponding to the value of its decision variable Vi. Denote the weight of the root node by wq and 
the weight of the selected edges by wi, W2, • • • , Wn~i- We have /(wi, V2, - ■ ■ , Vn) = wqWi ■ ■ ■ w„-iO = 
wqWi ■ ■ -Wn-i [ 1 ] . If a 0-edge is selected for variable Vi (i.e., if Vi = 0), we have Wi = I. Note 
that when the 0-child and the 1-child of a node / are the same node g, then that node can be directly 
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Figure 3: (a) Internal structure of a rotation-based decision diagram (RbDD) without and with a weighted 
root, (b) For a node /, if the 0-child and the 1-child are the same node g, f can be directly realized by 
a Rx{0) operator as shown in (c). 
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Figure 4: RbDDs for intermediate signals of a 3- input Toffoli gate shown in Fig. [2jc), redrawn in (b). 
In this figure, we have qi = aRx{'K)h, ri — aRx{TT/2)r2, and r2 = bRx{'^/'2)c. 



realized by a Rxi^) operator, as / = aRx{0)g demonstrated in Fig. |3(b) and Fig. 3(c) in terms of its 



child. Fig. |4(a) shows the RbDDs of functions pi, qi and ri in Fig. 2|c) (reproduced in Fig. [4(b)] ). 



Every RbDD with a chain structure such as the ones shown in Fig. |4(a) is associated with a cascade 



expression and can be realized with rotation and controlled- rotation operators. 

Suppose that the RbDD for a function / is given. The RbDD for h — Rx{'j)f can be obtained by 
multiplying the root weight of / by R^i-f). To obtain h = fRx{l)g for given RbDDs of / and g, we 
use the apply operatorj^ In this context, / and g are called RbDD operands of h. The apply operator 
is implemented by a recursive traversal of the two RbDD operands. For each pair of nodes in / and g 
visited during the traversal, an internal node is added to the resulting RbDD by utilizing the following 
rules which depend on the selected variable ordering -< (also see Fig. [5|. We assume that / and g have 
two general RbDDs shown in Fig 
conditions ORx{0)v — v and lRx{d)v 



5(a; 



The apply operator is recursively called with the terminal 

;rx{o)v. 

After recursive computation of the 1-child and 0-child of h, to maintain the canonicity of the resulting 
RbDD, isomorphic sub-graphs are merged and if the 0-child and the 1-child of a node are the same and 
the weight of the 1-edge is Rx{0) — I, then that node will be eliminated. In addition, to make RbDD 
of h canonical, the resulting weights for the 1-child and the 0-child of h should be modified by the 
method illustrated in Fig. 5(c) Fig. 6(a)| dem onstrates the result of performing apply operator on 
qi and ri in Fig. 4(a)[ redrawn in Fig! |6(a) to obtain r = qiRx{—n/2)ri. To construct RbDD 
for r, one needs to initially apply Rule 3 because both qi and ri use a as roots. Accordingly, wi = 
[Rx{Tr)b]Rx{—Tr /2)[Rx{tt /2)r2] and wq — bRx{—Tr/2)r2. To continue, consider wi and note that both 



^In general, for a binary operation op and two BDDs of functions / and g, the apply operator computes a BDD for 
/opg Mi- 
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• Rule 1 {a < b) The new node for h is a. The weights of l-child and 0-child are [Rx(ctr + 
a)fi]Rxh)9, and [Rx{ar)fo]Rxil)g, respectively. 

• Rule 2 {b < a) The new node for h is b. The weights of l-child and 0-child are fR^ (7) [Rx {Pr + 

I3)gi], and f Rx{l)[Rx{Pr)go\, respectively. 

• Rule 3 (a = 6) The new node for h \s a (or h). The weights of l-child and 0-child are 

[Rx{ar + a)fi]Rx{j)[Rx{l3r + /3)gi], and [Rx{ar)fo]Rx{7)[Rx{l3r)go], respectively 
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Figure 5: (a) Operands for operation h = fRx{"i)g- (b) The result of apply operator which adds 
a new node to the resulting RbDD h by using one of the three rules: if a < (Rule 1), v — a, 

wi = [Rx{ar+a)fi]Rx{-f)g, wq = [Rx{ar)h]Rx{n)g- lib < a (Rule 2), v^b,wi= fRx{j)[Rx{/3r+P)gi], 
■Wo = fRx{l)[Rx{(3r)go]- If a = & (Rule 3), v = a ^ b, wi = [Rxi^r + a)fi]Rx{'y)[Rx{Pr + I3)gi], 
Wq = [Rx{ar) fQ]Rx{^)[Rx{Pr)go\- (c) Weight modification for the apply operator to maintain canonicity 
of the resulting RbDD. 



[RxiTr)b] and [Rx{T:/2)r2] use fej^ As a result, applying Rule 3 leads to = [Rx{Q)0]Rx{—'n' /2)[Rx{Tr /2+ 
7r/2)c] and wifi — [Rx{TT)0]Rx{—TT/2)[Rx{Tr/2)c]. On the other hand, applying Rule 3 on wq leads to 
wq i = [Rx{Tr)0]Rx{~Tr /2)[Rx{tt /2)c] and wq q — [Rx{Q)0]Rx{—Tr /2)c. Using terminal conditions results 
in wi.i — RxMc, wi,Q = c, wq,i — c, and wo.q = c. Since wo.i — wo,o = c, we can remove variable b as 

is obtained after eliminating redundant nodes and edgesrl 



the d-child of a. The final figure in Fig. 6(a) 



4.2 Functional Decomposition and r-Linearity 

The problem of realizing a function / using Rx{0) and CRx{0) operators is equivalent to finding a 
rotation-based factored form for /, which can be performed by recursive bi-decomposition of /. 

Definition 4.4 Rotation-based bi-decomposition (bi-decomposition in short) of / is defined as finding 
functions g and h and value 7 such that / = gRx{'y)h. 

We use bi-composition of a given function / to construct / = gRx{j)h. Subsequently, g and h are 
recursively bi-decomposed, which will eventually result in a factored form of /. The bi-decomposition 
algorithm is based on the notion of r-linearity. 

Definition 4.5 For function /(wi, ■ • • , Wi-i, Vi, Wi+i, • • ■ , v„), variable Vi is r-linear if there exists a ro- 
tation value 9i such that for every value assignment to wi, • • • , Ui+i, ■ ■ ■ ,Vn ■ fvi = Rx{Oi)fvi , where 
fvi = /(«!,■ • • , i, Vi+1, ■■■ ,Vn) and fy^ = f{vi, ■ ■ ■ , Wi-i, 6, Wj+i, • • • ,-;;„). A variable is r-nonlinear 
if it is not r-linear. 

Now we present a number of key results. 

^To understand the RbDDs of [Rx{TT)b] and [Rx{n/2)r2], recall RbDDs of b and r2 in Fig. |6(a)| and use weights RxiTr) 
and Rx{it/2) for roots of b and r2, respectively. 

*Note that the commutative property of matrix multiplication for Rx{d) matrices is critical for the apply operator. 
Performing apply as described may not generate the correct result for decision diagrams with non-commutative weights. 
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Figure 6: (a) An example of performing apply operator on two RbDDs. In the first call of the apply 
operator, wi = [Rx{'n-)b]Rj;{—Tr/2)[Rx{n/2)r2] and wq = bRxi—ir /2)r2- The final figure is obtained 
after eliminating redundant nodes and edges, (b) A general RbDD structure with r-linear variables 

Wfc+l, • • • ,Vn- 



Lemma 4.6 Consider a function /(vi, W2, ■ ■ • , v^) with variable ordering vi < V2 < ■ ■ ■ < Vn and assume 
that k + 1 < i < n. Iff each variable Vi is r-linear, then there is only one RbDD node rij for each r-linear 
decision variable Vi. The weight of the 1-edge of rii is Rx{Oi). 

Proof. The proof is by induction on w„, Vn-2, ■ ' ' > Vk+i starting from Vn- | 

Let Vk be the lowest indexed r- nonlinear variable after which Vk+i, Vk+2, • • • are r-linear variables 



of /. From Lemma 4.6, for k + 1 < j < n we have /y^ — Rx{()i)Ivj where 9j is fixed independent 



of vi,t'2,''' iVj-iTVj+i, - ■ ■ ,Vn values. As illustrated in Fig. 6(b) every path from the root node of 
the RbDD to the terminal node will either go through an internal node with decision variable Vk or 
it will skip any such node and directly go the single RbDD node with decision variable Vk+i- For 
the latter case, /t,^ = Rx{^)fv^. — fv^ and for any former case fy^ = Rx{o-i)fvk for some (vs. all) 
vi,--- , tife-i, Ufe+i, • • • ,f„. Additionally, the number of different rotation angles (e.g., ai,a2 in Fig. 
|6(b)[ ) for variable Vk is equal to the number of internal nodes with decision variable Vk in the RbDD. 

Definition 4.7 The degree of r-nonlinearity of variable Vk, r-deg(wj), is m — 1 where m is the number of 
different rotation angles ai (including if any) that fv^. — Rx{ai)fv^ for some wi, • • • , Vk-i, Vk+i, • • • , Vn- 
For r-linear variables the degree of r-nonlinearity is zero. 



As an example, consider the RbDD of r in Fig. |6(a) and note that r-deg(6) = 1 as there are two 



rotation angles (i.e., and tt) for b. Similarly, r-deg(c) = and c is r-linear. 

Lemma 4.8 Let m denote the number of internal nodes with decision variable Vk- If all paths from 
the root node of the RbDD to the terminal node go through an internal node with decision variable Vk, 

r-deq(ufc) = m — 1; otherwise r-deg(ufc) — m. 

Proof. The proof follows from considering the general structure of RbDDs and the definition of 
r-nonlinearity. | 

Theorem 4.9 Consider a function /(wi, W2, • • • , Vn) with variable ordering vi < V2 < ■ ■ ■ < Vn- Define 
g such that if /^^ = Rx{oii)fvk then g = 1] otherwise g = 0. Assume that Wfc+i,Wfe+2, • ■ ■ ,Vn are r-linear 
variables of / and Vk is a r-nonlinear variable of / with r-deg(ufe) = m — 1. Additionally, for each value 
assignment to variables wi, • ■ ■ , Wfe-i, w/c+i, ■ • • ,Vn suppose exactly one of the following m relations holds: 
fvk = Rx{ai)fvk, fvk = Rx{a2)fvk, ■■■ , fvk = Rx{am)fvk- We have 

I / can be bi-decomposed a.s f — giRxij)h where gi — VkRx{T^)g, 7 — {0L2 — Q!i)/2, h = 5ii?x(~7)/- 
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Algorithm 1 factor (/) 



1; If all variables are r-linear, then return the corresponding cascade expression for /. 

2: Find the lowest indexed r-nonlinear variable Vk after which Vk+i, Vk+2, • • • ,Vn are r-linear. 



3: Bi-decompose / using Vk as f = giRx{'^)h where gi, h, and 7 are given in Theorem 4.9 
4: Return [factor (gi)]i?a;(7)[f actor (/i)]. 



II gi is a function of wi, V2, ■ ■ ■ , Vk, i.e., gi is invariant with respect to Vk+i, i'fe+2, • • • , Vn- 

III Vk is a r-linear variable of gi. 

IV h is a function of vi, V2, ■ ■ ■ ,Vn and Ufc+i, Vk+2, ■ • • ,Vn are r-linear variables of h. 
V r-deg(ufc) in ft, is < m — 2. 

Proof. We initially prove that function g is invariant with respect to Wfe+i, ffc+2j • • • ^v^, i.e., g^. = g^. 
for k -\- 1 < i < n. Since Vi is r-linear, there exists 9i such that for all ui, • • • , t^i+i, • • ■ , f n values, 
fvi = Rx{Oz)fvi which resuhs in = Rx{0i)fv,vk and /^.p^ = Rx{Qi)hiVk- From the definition of 5 
we have: 

• If = Rx{ai)fv,vk^ then g^,^ = i, else g^^ = 0. 

• If /s^t,^ = then g^. = i, else g^. = 6. 
Combining these relations proves gi,. = g^^ : 

fviVk = Rx{oil)fviVk Rx{Si)fviVk — Rxi^l + di)fviVk ^ hiVk = Rx{oil)fviVk 

Since gi = VkRx(T^)g, gi is also invariant with respect to Vk+i,Vk+2, ■ ■ ■ ,Vn (part II). Moreover 
givk = RxMg and gw^^ = g which results in giy,^ = Rx{TT)gi^^, i.e., Vk in gi is r-hnear (part III). 

The first sentence of part IV is clear from the definition of ft = giRx{—'y)f- As for the second 
one, note that Vk^i,Vk+2, ■ ' ' are r-linear variables of /. Additionally, g is invariant with respect to 
Vk+i,Vk+2i ■ ■ ■ ,Vn- Putting these facts together proves part IV. 

Now we prove r-deg(i'fe) < to — 2 in ft, = giRx{—j)f. For each value assignment to variables 
vi,V2, ■ ■ ■ ,Vk-i,Vk+i, ■ ■■ ,Vn exactly one of the following to relations holds: fy^ = Rx{ai)fy^, fy^ = 
Rx{a2)fvk, ■ ■ ■ , fvk ^ Rx{otm)hk- For each of the above cases, we examine the relation between ft^^. 
and hy^: 

• /iij. = Rx{(^i)fvk'- By definition .g = 1 and we have: 

hy^ = iRx{~j)fyk Rx{~l)fvk ^ = Rxh)hy^ 

hy^ = [RxiTT)i]Rxi-^)fvk = fvk = Rxiai)fvk = Rx{ai + j)hy^,j = (02 - ai)/2 ^ 

. _ D (ai+a2\i_ 

"■Ufe ~ ^^X\ 2 )"""k 

• fyf, — Rx{(^2)fvk'- By definition .g = and we have: 

hy^ = ORx{-l)fvk = hu 

hy^ = [Rx{T^)^]Rx{-l)fvk =Rx{-l)fvk = Rx{-J + a2)fvk,l^ {a2-ai)/2^ 

Ityk — llx\ 2 )"'Vk 

• fy^ = Rx[cii)hki 3 < i < m: By definition 5 = 6 and ft„^ = Rx{-J + ai)hy^. 

The first two cases result in the same relation between hy^ and hy^ as hy^ — Rx{ "'^^°'^ )hyf,. The 
remaining to — 2 cases result in at most to — 2 different relations between hy^ and hy^, . Therefore, the 
total number of different relations between ft„j. and hy^^ is < to — 1. Accordingly, r-deg(wfe) in ft is < to — 2 
(part V). 

Finally, from ft = giRx{—l)f it follows that giRx{j)h = giRx{'y) [giRx{~'j)f]- Consider gi = 
VkRx{'^)g and assume .g 1 (or g ~ 0) which leads to gi = VkRxi^^)^ Vk (or gi — Ufei?j;(7r)0 = Vk). 
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Algorithm 2 gi-f actor (RbDD/) 



1; Change all of the weights to Rx{0) = I- 

2: Create a RbDD node Vk with wi = Rx{tt) and wq = I to the terminal node (i.e., 0). 

3: Redirect all edges toward ni to node Vk and make the weight of all such edges Rx{tt)- 

4: Redirect all edges toward n2 , ris , • • ■ , n„j to node Vk and make the weight of all such edges RxiO)- 

5: Discard nodes n2, ns, • • • , rim- 

6: Merge isomorphic sub-graphs, eliminate nodes with the same 0-child and 1-child exactly if the weight 
of the 1-edge is Rx{0) — I- Update weights of the RbDD to make the RbDD of gi canonical. 



Altogether for both Vk = 1 and Vk = 6, we have giRx{"f) [giRx{—l)f] = /• Hence, / can be bi- 
decomposed as f = giRx{-f)h (part I). | 

Using the proposed bi-decomposition approach, / can be bi-decomposed into / = giRx{'y)h where 171 
and h are themselves recursively bi-decomposed until a rotation-based factored form is obtained. 

Theorem 4.10 The proposed bi-decomposition approach always results in a cascade expression for a 
given function /. 



Proof. Following the definitions given in Theorem |4.9| for / = giRx{'^)h, since gi is invariant of 
Vk+i,Vk+2, ■ ■ ■ ,Vn and Vk in g is r-linear and r-deg(wfe) in /i is < m — 2, the recursion will finally stop 
at terminal cases where gi and/or h have directly realizable RbDDs — all variables are r-linear in 
the functions and they have rotation-based cascade expressions corresponding to RbDDs with a chain 
structure]^ | 

Algorithm [T] uses the proposed recursive bi-decomposition approach to generate a rotation-based 
factored form for a given function /. All steps in Algorithm [T] can be directly performed on RbDDs. If 
the RbDD of a function / is a chain structure, we have a cascade expression for / (Step 1). For Step 2 as 
depicted in Fig. |6(b)| and according to Lemma 4.6 identifying Vk is equivalent to identifying the lower 



chain-structure part of the RbDD. As for Step 3, according to Lemma |4.8| values ai, 02, • • • , Um can be 
obtained from weights of the 1-edges of nodes with decision variables Vk- Hence, 7 — (a2 — ai)/2 is 
obtained. Let n,i{l < i < m) denote nodes with decision variable Vk and 1-edges weight Rx{on). Starting 
from the RbDD of /, one can perform Algorithm [2] to construct RbDD of gi. Having the RbDDs for 171 
and /, the RbDD of ft- = giRx{^l)f can be obtained by using the apply operation. As an example of 
Algorithm [2] see RbDDs of s and gi in Fig. |8]and Fig. [9] where Vk = c and m = 2. This example is 
described in detail in Section [6l 

The final form after apply is / = giRx{ji) [32^^(72) [gsRxils) ■ ■ ■ [gkRx{lk)0] ] ]■ Note that gi 
functions should also be decomposed. The factor algorithm is not optimal. In particular, / can 
be rewritten as / = gp^Rxilp^) [fe -^a: (7^2 ) [gps^xilpa) ■ ■ ■ [gp^RxilpjO] ] ] where (pi,p2, ■ ■ ■ ,Pk) is a 
permutation of 1, 2, • • • , fc. Different permutations of 1, 2, • • • , fc may result in different number of gates 
after synthesis. For example, consider the RbDDs of the output s in a 4-input Toffoli gate, shown in 
Fig. [sj for two different variable orderings a > b > c > d and a > b > d > c. In Fig. |8(b)[ d is r-linear. 



However, none of the variables in Fig. 8(c) are r-linear. Accordingly, the proposed approach results in 
fewer gates for a > 6 > c > d. The former case is further discussed in Section [6] Indeed, working with 
a > b > d > c leads to s = giRx{—Tr/2)h where gi = abd ® c is a 4-input Toffoli gate targeted on the 
last qubit c ioi a > b > d > c. 



5 Working with Arbitrary Outputs 

For the input vector U, a function / with binary inputs and outputs can be written as f{U) — 
gi{U)Rx{ji) [g2{U)Rx{'^2) [■ ■ ■ [gk{U)Rx{'^k)0]]]. Since functions gi{U) only take values and i, 
f{U) can also be represented as f{U) = Rx{gi{U)^i + g2{U)l2 + ■ ■ ■ + gk{U)jk)0 where gi{U) values 
are either zero (0) or one (1)Q Define 7(C/) = gi{U)^i + g2{U)j2 + ••• + gk{U)lk which lea ds to 



f{U) = Rx{j{U))0. Accordingly, the structure of the synthesized circuit can be represented as Fig. 7(a) 



^As a result of Lemma 



in a function with chain structured RbDD, all variables are r-linear. 



^To prove, assign arbitrary values and 1 to gi terms and consider the resulting rotations by different 7^ values. 
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(a) 



(b) 



(c) 



Figure 7: (a) Quantum circuit performing f{U) — Rx{'y(U))6. (b) Quantum 3-input multiplexer. Dashed 
boxes represent G, G' , and Rx{l) in (a). Only rotation angles are reported for Rx{9) gates, (c) Quantum 
circuit performing /([/) = R,X7{U))Rx{0{U))6. 
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Figure 8: 4-input TofFoli gate (a), the RbDD for the 4th output s in two cases: i{a>b>c>d (b), 
and if a > 6 > d > c (c). 



In this figure, G is a circuit that constructs 7(C/) and G' is the inverse of G. Note that G' should be 
used only if one wants to keep input lines unchanged. To clarify the roles of G and G , see the 3-input 



multiplexer circuit / = sxi + SX2 synthesized by the factor algorithm in Fig. 7(b) If instead of 0, 
another quantum value q is used in this circuit as the initial value for the input, the resulting circuit 
implements f{U) ~ R^ [j{U)] q. The constant ancilla register in Fig. |7(a) may not be necessary in some 
case. For example, the controlled rotation Rxi"^) with control qubit a and target generates a as the 
second output and the use of the controlled rotation Rxi"^) in this case is unnecessary (i.e., aRx{T^)^ = a)- 
Section [6] shows several examples. 

Now consider a given function that for given basis input vectors generates a general value f{U) = 
[ foiU) h{U) Since \h{U)\' + \h{U)\' = 1, we may rewrite /([/) as: 

f{U) = e'^(^) [ e-^^(^)/2 cos ^ -ze^^tf^)/^ sin ^ ] ^ 

Hence, f{U) can be expressed as f{U) — e^^^^^ Rz{^{U))Rx{0{U))d where R^ is the rotation operator 
around the z axis. We can ignore the global phase e^^^^^ since it has no observable effects [Tj. Therefore, 
one can effectively write f{U) = Rz{-f{U))Rx{0{U))6. Note that Rz{-/)Rx{9)d results from 9 rotation of 
around the x axis followed by 7 rotation around the z axis in the Bloch sphere. The quantum circuit 
for /([/) = Rz{j{U))Rx{9{U))d can be synthesized as: 

• Synthesize g{U) = Rx{9{U))0 by using the factor algorithm. 

• Synthesize h{U) — Rz{'^{U))0 by using the factor algorithm. 



• Cascade the resulting circuits as depicted in Fig. 7(c) In this figure, Gi and G2 are for g{U) and 
h(U), respectively. Accordingly, G'l and G'2 are the inverse circuits of Gi and G2. 
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Figure 9: RbDDs required to synthesize the 4-input TofFoh gate in Fig. |8] 



6 Results 



Multiple-control Toffoli gate. Consider a 4-input TofFoli gate in Fig. 8(a) and the RbDD of the 

target output in Fig. |8(b)| with variable ordering a < b < c < d. Comparing the RbDD of s with the 
general RbDD structure in Fig. |6(b)| reveals that variable c corresponds to Vk- Additionally, r-deg(c)=l, 
ai = and a2 — tt which result in 7 ~ tt/2 (Theorem 4.91 ^ 

Function s can be bi-decomposed as s = giRx(—TT/2)h where gi = cRx{'^)g- RbDDs for g and gi 
are shown in Fig. |9|a) and Fig. ^h), respectively!^ Note that gi is a 3- input ToffoH gate (see RbDD 
of r in Fig. 6(a)), which can be synthesized as in Fig. [2](c). As for function h, it can be written as 
h = giRx{n/2)s. The RbDD for h (by the apply operator) is shown in Fig. |9](c). Subsequently, h can 
be bi-decomposed as h = g2Rx{—'i^ / ^)hi where (72 = aRx{'r:)b (by algorithm [2| and hi = (72^x(7r/4)/i (by 
the apply operator). The resulting RbDDs for g2 and hi are shown in Fig.^|9]jd) and Fig. ^e). Finally, 
the factored form for s is s = giRx{—n/2)[g2Rx{—T''/'i)hi]. 

Due to the chain structure of 172 and hi, they may be directly realized by using controlled- rotation 
operators. Note that when realizing gi, we also implement g2- The final circuit is shown in Fig. [TO] The 
first subcircuit generates output s whereas the remaining gates generate outputs p, q and r. 

As a direct extension of the above approach, consider a multiple-control Toffoli gate on n + 1 
qubits with controls ii, 12, «„ and target j. Toffoli output can be written as j — «iZ2 • • • © j. Assume 
ii < 12 < ■ ■ ■ < in < j ■ It can be verified that Vk (in Algorithm [T]) is j„ and we have r-deg(i„) — 1 with 
ai = 0, a2 = TT, and 7 = tt/2 (in Theorem 4.9 1. Therefore, one can write j — giRx{—TT/2)h. It results 
in gi = 1112 • • • in-i ® in and h = [1112 ■ ■ ■ in~i\Rx(T^ l'2)[inRx{T^ Now, gi is an n-qubit Toffoli gate 
and can be decomposed independently following the same approach. To decompose h, one can verify 
that Vk = in-i in Algorithm [I] with r-deg(i„_i) — 1, ai = 0, a2 = 7i'/2, and 7 = 7r/4. Accordingly, we 
can write h — g2Rx{—TT/4)hi. Applying Algorithm [2] reveals that 172 is an (n— l)-qubit Toffoli gate with 
in-i as the target and ji,«2i ■ ' ' as controls. By using the apply operator, hi = g2Rx{'^/'^)h which 

leaves — in-2- Altogether, we can write: 



^One may set ai = n and a2 = 0. This combination generates a different circuit with the same functionality. 

*RbDD of g\ can be obtained by using Algorithm [2] and RbDD of s — no need to construct RbDD of g. However, an 
interested reader can verify that an indirect approach to construct RbDD of g (and hence the function of g) is to replace 
tifc by 6 in RbDD of g\ which is constructed from applying Algorithm jS] 
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Figure 10: Automatic synthesis of a 4-input Toffoli gate obtained by the factor algorithm. Only 
rotation angles are reported for Rx{S) gates. The first subcircuit generates output s in Fig. [s] whereas 
the remaining gates generate outputs p, q and r. 




t = abed © e 



Figure 11: Synthesized circuit for a 5-input Toffoli gate. Only rotation angles are reported for Rx{0) 
gates. The dashed subcircuit generates output for a 4-input Toffoli gate (see Fig. 10 1 and U^^ is the 
reverse of this subcircuit. 



C"Rx{tt) = C"-ii?,(^)i?,(-^/2) 



[C"-("-i)i?,(7r)i?,(-V2""i) 
[zii?,(7r/2"-i)fei?,(^/2"-i)fei?,(7r/2"-2)(. • • (^„i?,(7r/2)J) • • • )))] 



To construct the circuit, for [iii?2,(7r/2"^^)(i2i?2;(7r/2"^^)(- • • (i„i?^(7r/2)j) • • • ))] one needs to add 
n controUed-rotation gates with controls on ii, Z2, • • • , i„ and targets on j. This subcircuit should be 
followed by constructing a. gi — C^~^ Rx{it) gate which automatically constructs all 32 = C'^~'^Rx{'^), 
gs = C'^~^Rx{n), ■ ■ ■ , g^ — CRx{tt) gates too. Next, one needs to use n — 1 controUed-rotation gates 
with controls on 172, 53, • • • , <?n and targets on j. Altogether, we need COSTi cinot = 2n — 1 + 
COSTi cn-iNOT controUed-rotation gates to implement a C"NOT gate. To restore ii, «2, • • ■ , *ra qubits to 
their original values, additional cost should be applied which is C0ST2,C"N0t = COST^ gn-iNOT: i-S-, all 
gates excluding gates with targets on j. Terminal conditions are COSTi c^not = 4 and COST2_c2not = 
1 (see Fig. [2jc)). Total implementation cost is COSTcnNOT = COSTi^c"NOT + COST2,c"NOT which is 
polynomial, i.e., 2n^ — 2n + 1. Fig. 
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illustrates this construction for a 5-input Toffoli gate. No ancilla 
is required in the proposed construction. Current constructions for a C"NOT gate use an exponential 
number of 2-qubit gates 2"+^ — 3 [13, Lemma 7.1] or ASri^ + 0{n) arbitrary 2-qubit operations [T31 Lemma 
7.6], if no ancilla is available. | 

Quantum adder. Consider a full adder with inputs xi, X2, and {xi < X2 < X3) and outputs 
s = xi ® X2 ® X3 and c = X1X2 + X1X3 + X2X3. The RbDDs of s and c are shown in Fig. |12(a)| The 
RbDD of s has a chain structure that corresponds to a cascade expression and can be directly realized. 
On the other hand, the RbDD of c should be recursively decomposed by using Algorithm [l] Using this 
algorithm, c is bi-decomposed as c = giRx(—Tr/2)h. 
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To construct RbDD of gi note that Vk = x^. Applying Algorithm [2] leads to four internal nodes as 
follows. Node ni with the decision variable xi^ wi = wq = I, and 1-child node 712, and 0-child node 
713. Node with the decision variable X2, node weight Rx{t^), Wi — RxIt^), wq = /, and node 714 as 
both 1-child and 0-child. Node 713 with the decision variable X2, wi = Rx{t^), wq — I, and node 714 
as both 1-child and 0-child. Node 714 with decision variable X3 connected to the terminal node with 
wi = RxiT^), and Wq — I. A careful consideration reveals that this RbDD can be converted to the one 
constructed for s in Fig. 12(a) Therefore, gi has a cascade expression and a realizable rotation-based 
implementation. Finally, the RbDD for h — giR^iir /2)c is shown in Fig. 12(a) As can be seen, the 



RbDD of h has a chain structure too. The resulting quantum circuit is depicted in Fig. 12(b) 



Now consider a 2-qubit quantum adder with inputs ai, oq, hi, 60 for aq < 60 < cii < &i and 
outputs c, si, and sq for sq < si < c. It can be verified that sq = &o © ao,si — aobo ai 6i,c = 
aoboai © aobobi © aibi. Applying the above approach leads to the following equations: 

So = aoRx{'n-)bo 

si = giRx{-Tr/2)hi 

91 = So 

hi = aoRxin/2){boRxin/2){aiRxin)bi)) 
c = g2Rx{-T^/2)h2 

92 = Si 

h2 = 53^2;(-7r/4)/l3 

53 = So 

/73 = aoi?..(7r/4)(fooi?r.(7r/4)(aii?,(7r/2)(6ii?,(7r/2)6))) 

Therefore, sqjSi, and c can be implemented by one, four, and six 2-qubit gates (11 in total), respec- 
tively. The circuit uses one ancilla for c; ao,ai remain unchanged and si and so are constructed on bi 
and 60 1 respectively. 

To generalize, consider an n-qubit quantum ripple adder with inputs and &j and outputs Si 
and cforO<i<7i— 1 and oq < 60 < < ^1 < ■ ■ ■ < fln-i < ^n-i and So < si < • • • < Sn_i < c. We 
have: 

So = aoRx{'iT)bo 

si = SoRx{-n/2){aoRx{n/2){boRx{TT/2}{aiRx{n){biRx{n)Q)))) 



s„_i = s„-2Rx{~7r/2){sn-3RA-^/'^^){- ■ ■ [sM-^IT-^) 
(aoi?.(V2""')(&oi?.(V2""') 
{aiRx{^/r-^){bM^l2^-^) 

{a„-2Rx{Tr/2)ibn-2Rx{7r/2)) 

(a„_ii?,(^)(5„_ii?,(^)6)))---) 

c = s„_ii?,(-7r/2)(s„_2i?.(-V2')(- • • (soi?.(-V2") 
(aoi?.(^/2")(6oi?.(7r/2") 
{aiRx{n/2"-^){biRJn/2^-^) 



(a„_2i?.(^/4)(6„_2i?.(7r/4)) 

{an-lRx{TT/2){bn-lRx{n/2)6))) ■■■) 



To count the number of 2-qubit gates, note that there are 271 gates on c, 2n~ 1 gates on 27i — 3 

gates on bn^2, ■ • • , 3 gates on bi and 1 gate on 60 in the proposed construction. This subcircuit should 
be followed by a 2-qubit gate conditioned on 60 with target on 61, 2 gates conditioned on bg and bi with 
targets on &2, 3 gates conditioned on 60, 61, ^2 with targets on 64, etc. Altogether, an 77-qubit quantum 
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Figure 12: (a) RbDDs for a 3-qubit full adder and (b) the synthesized circuit. Only rotation angles are 
reported for Rx{9) gates. 



ripple adder can be implemented with l/2(3n^ + 5n) controlled-rotation gates. Fig. 13 illustrates the 
proposed construction for a 5-qubit carry-ripple adder. This circuit is restructured in Fig. 14 with 
parallel gates. Compared to the construction in [Ml Figure 7] with depth 28, our circuit uses a wider 
varieties of rotation angles to reduce the depth to 23. Circuit depth for n = 2, • • • , 15 is 9(10), 12(16), 
19(22), 23(28), 27(34), 31(40), 39(46), 43(52), 48(58), 51(64), 57(70), 61(76), 66(82), 70(88) where a{b) 
denotes a 2-qubit gates in the proposed construction and h 2-qubit gates in [150 | 

Quantum multiplexer. Consider a 3-input multiplexer / = sxi + sx2 with s < xi < X2. 



Following Theorem 4.9 leads to ai = 0,a2 — n,^ — 7r/2 and / = (7i_R^(7r/2)/i for — X2 with r- 
deg(a;2) = 1. To construct gi note that we use ai — 0. It results in a chain structure for gi with the 
factored form 171 = sRx{t^)x2- 

To construct the RbDD oi h ~ giRx{—iT/2)f using the apply operator, note that both gi and 
/ use s. Accordingly, one needs to apply Rule 3 which results in wi = [Rx{t^)x2\Rx{~''^ /'i)xi and 
Wq = X2Rxi~'^ /'^)x2 = X2Rx{t^ 1'^)^- Continuing this path results in /i = g2Rx{—'^/'^)hi^ g2 = sRx{t^)xi, 
and hi = xiRxi^n /2)[x2Rx{-n 

Altogether, / = [si?:j:(7r)a;2]i?J-7r/2)[[si?:r(7r)a;i]i?j;(-7r/2)[a;ii?^(7r/2)[a;2i?2.(7r/2)6]]]. The resulting 



quantum circuit is shown in Fig. 15 ^a). Note that one 0-initialized qubit is added to setup [x2i?a;(7r/2)d]. 

The 6-input multiplexer / = S1S2X1 + S1S2X2 + S1S2X3 + S1S2X4 can be constructed by using 
three 3-input multiplexers, which would require 3 extra ancillae. However, if the factor algorithm is 



directly applied, the resulting circuit only uses one ancilla as illustrated in Fig. 15 |b). For an n-qubit 
quantum multiplexer with [logn] selects and n inputs, the proposed approach leads to 2n 2-qubit 
gates and n C ^'"s R^ (tt) gates with one ancilla. As the cost of an n-qubit multiple-control Toffoli gate 
is 2n^ — 2n + 1, the proposed approach leads to 0(n log'^ n) gates, i.e., 2n -I- n(2[logn]^ — 2 [logn] -I- 1). 
We found no explicit construction for an n-qubit quantum multiplexer in the literature. However, one 
can use n Cr'°s"l+iC]\JOT gates in a circuit with one zero-initialized ancilla to implement an n-qubit 
multiplexer. Considering linear-size cost for each gate [13] leads to O(n^) costj^ | 

Quantum Fourier Transform. The quantum Fourier transform (QFT) is used in many 
quantum algorithms. QFT has an efhcient quantum circuit implementation based on R^ gates llj. 
The result of applying QFT on inputs ji, j2, ■ ■ • , Jn is |0) + e^'^^O-J" |1), |0) + e^^O J^ ':'" |1), • • • , |0) -I- 



®The construction in |14| generates a circuit with controlled-rotation gates with phase 7r/2 and total depth 6n — 2 for 
an n-qubit carry-ripple adder. We guess our circuit depth is 5n -|- 0(1). The trend lino for the number of 2-qubit gates in 
the proposed construction for n = 2, ■ • • , 15 is 4.7868n — 0.9736. 

For example, a 4-to-l multiplexer can be implemented as T(s'q,s'-^,xq, f), T(s'q, si, xi, f), T{so, s'-^^, X2, f), 
T{so, s'^, x:j, f) for a circuit with selects soi^ii inputs xq, xi, X2, x-j and output /, where / is a zero-initialized ancilla. 
For each T (Toffoli) gate, the first three lines act as the control lines and the last line acts as the target. In addition, e.g., 
T(sq, s'^, xq, f) applies xq on / when sq = 0, si = 0. This can be implemented by N(so), N(si), T{so, si, xq, f), N(si), 
N{so) where N denotes the NOT gate. 
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Figure 13: 5-bit ripple-carry adder synthesized by the proposed approach with 50 2-qubit rotation gates. 
Different gates with the same control line are represented as one gate with several targets. For example, 
the gate in the dashed box includes two rotation gates conditioned on 04 with targets on c (ancilla) and 
hi. 
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is used. 

,J„) = |0)+e2"°-^'i^'^-^""|l) 



For this function 



g27ri0.jij2 - jn where the common notation 0.jij2 ■ 

Following the discussion in Sectionjsj in the first step the output fn{ji,j2, 
is described by /„(J) = e'^^'^'> R,^{-f{J))R^{0{J)) |0) where J = ji2"-i+j22"-2 
e{J) = (7r/2)ji, and g{J) = i?,(0(J))6 = E^(7r/ 2)6. 

The RbDD for Rz{'j{J)) is shown in Fig. 16(a) where the root node is weighted. This RbDD 
corresponds to a cascade expression and there is no need to perform bi- decomp osition. The quantum 
circuit implementing /„(J) = e^^'^'^'^ Rz{'y{J))Rx{0{J)) |0) in shown in Fig. 16(b) 

The single qubit operation Rz{—tt/2) can be moved between Rx{it/2) and controlled Rz{tt) operations. 
Since ji is used as the controlled qubit of only one controlled rotation operation, the sub-circuit in Fig. 
17 [a.) can be replaced by a single qubit operator shown in Fig. [iTj^b). The 2x2 matrix describing U 
consists of two columns uq and Ui such that U ^ [ uq ui ] and can be obtained as follows: 



uo^R,i-Tr/2)R,{TT/2)\0) 



-ITT /4 



ui = i?,(^)i?,(-V2)i?.(7r/2)|0) 



V2 

i?,(V2)i?.(^/2)|0) 



-iir/4 



1 

-1 



Hence, we have: 



U = 



1 1 

1 -1 



This operator can be replaced by the Hadamard operator since the two operators differ only in a 
global phase. Therefore, the quantum circuit for |0) -I- e^"'' -'^-'^ '^" |1) can be realized as shown in Fig. 
TtJJc). The remaining outputs can be generated similarly. Accordingly, the proposed method results in 
the same circuit structure in [1 with n{n + l)/2 total gates. This can show the efficiency of the proposed 
automatic synthesis approach. | 



7 Conclusions and Further Discussion 

We mainly addressed reversible logic synthesis by quantum rotation-based gates. A new canonical 
representation model was proposed based on binary decision diagrams. Focused on it, we developed a 
synthesis framework to manipulate circuits and to synthesize functions with binary variables. We also 
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Figure 14: Circuit in Fig. 13 with depth 23. The notation LiJ represents controlled-rotation gates where 
a is the control hne and 9 is the rotation angle. 
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Figure 15: (a) Quantum 2-to-l multiplexer, (b) quantum 4-to-l multiplexer. Circuits are directly 
obtained by the factor algorithm. Only rotation angles are reported for Rx{0) gates. 



showed that the proposed approach can be extended to work with functions that generate arbitrary 
outputs for binary inputs. 

While almost all previous synthesis methods with favorable results [7] used CNOT, controUed-V and 
controlled- gates (see Fig. [l| as primitive gates with unit cost, we used 2-qubit controlled-rotation 
gates. This work can be particularly considered as a synthesis method for Boolean reversible circuits 
that computes a given Boolean function outside the Boolean domain with quantum gates [12] . We hope 
this new insight opens further analysis and investigation to efficiently address quantum and reversible 
logic synthesis possibly beyond current achievements [7]- 

To realize a given quantum computation by fault-tolerant gates, one needs to use those gates that 
have direct fault-tolerant implementations [I]. Such realizations are only available for a few operations 
such as Clifford gates. To implement a wider set of gates such as the ones we used in this paper, one 
must apply the set of fault-tolerant gates to accurately (by approximation) implement other gates. This 
can be done by the Solovay-Kitaev algorithm [T]. Given the point that the proposed approach uses 
controlled rotation gates with various angles, fault-tolerant implementation of the proposed circuits can 
be costly. Future work should address this issue. Additionally, further progress on this path may result 
in new observations to restrict /ignore angles [32l|33] and to remove redundant gates. 
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